Are Initial / Final Units Acoustically Accurate ?

نویسندگان

  • Pascale Fung
  • Kwok Leung
چکیده

| We show a comparative study of subword unit segmentation of Mandarin speech data. Most HMM recognition systems use intial//nals as subword units for Mandarin speech. We nd that such a division of monosylla-ble data into intial//nal units are not always supported by acoustic evidences. We implement a delta MFCC based seg-mentation method and compare its output with that of Viterbi segmentation based on initial//nal units. We found that whenever the two output diier, the acoustic based method outperforms the initial//nal based method. This raises the questions of whether we should use intial//nals as subword units for training HMMs in large vocabulary Mandarin speech recognition

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic segmentation and clustering of speech using sparse coding and metaheuristic search

We propose a constrained shift and scale invariant sparse coding model for the purpose of unsupervised segmentation and clustering of speech into acoustically relevant sub-word units for automatic speech recognition. We introduce a novel local search algorithm that iteratively improves the acoustic relevance of the automatically-determined sub-word units from a random initialization by repeated...

متن کامل

Accuracy of perceptually based and acoustically based inspiratory loci in reading.

Investigations of speech often involve the identification of inspiratory loci in continuous recordings of speech. The present study investigates the accuracy of perceptually determined and acoustically determined inspiratory loci. While wearing a circumferentially vented mask connected to a pneumotach, 16 participants read two passages. The perceptually determined and acoustically determined in...

متن کامل

Mapping from sound to meaning: reduced lexical activation in Broca's aphasics.

Recent studies of lexical access in Broca's aphasics suggest that lexical activation levels are reduced in these patients. The present study compared the performance of Broca's aphasics with that of normal subjects in an auditory semantic priming paradigm. Lexical decision times were measured in response to word targets preceded by an intact semantically related prime word ("cat"-"dog"), by a r...

متن کامل

Destruction of Recombinant Tissue Plasminogen Activator (rtPA) -Loaded Echogenic Liposomes under Dual Frequency Sonication

Background:Echogenic liposomes (ELIPs) encapsulate drugs and gas bubbles within lipid vesicles. The destruction of ELIPs in response to MHz and kHz ultrasound waves has been studied previously. Applying ultrasound above a certain threshold causes encapsulated gas bubbles destruct rapidly by fragmentation or more slowly by acoustically driven diffusion. This study compares the d...

متن کامل

Speech recognition based on acoustically derived segment units

This paper describes a new method of word model generation based on acoustically derived segment units (henceforth ASUs). An ASU-based approach has the advantages of growing out of human pre-determined phonemes and of consistently generating acoustic units by using the maximum likelihood (ML) criterion. The former advantage is e ective when it is di cult to map acoustics to a phone such as with...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007